A Proposal for Lexical Disambiguation

نویسندگان

  • George A. Miller
  • Daniel A. Teibel
چکیده

A method of sense resolution is proposed that is based on WordNet, an on-line lexical database that incorporates semantic relations (synonymy, antonymy, hyponymy, meronymy, causal and troponymic entailment) as labeled pointers between word senses. With WordNet, it is easy to retrieve sets of semantically rehted words, a facility that will be used for sense resolution during text processing, as follows. When a word with multiple senses is encountered, one of two procedures will be followed. Either, (1) words related in meaning to the alternative senses of the polysemous word will be retrieved; new strings will be derived by substituting these related words into the context of the polysemous word; a large textual corpus will then be searched for these derived strings; and that sense will be chosen that corresponds to the derived string that is found most often in the corpus. Or, (2) the context of the polysemous word will be used as a key to search a large corpus; all words found to occur in that context will be neted; Word.Net will then be used to estimate the semantic distance from those words to the alternative senses of the polysemous word; and that sense will be chosen that is closest in meaning to other words occurring in the same contexL If successful, this procedure could have practical applications to problems of information retrieval, mechan/cal translation, intelligent tutoring systems, and elsewhere.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Qualia Based Lexical Knowledge for the Disambiguation of the Japanese Postposition 'No' (short paper)

This paper proposes the elaboration of the qualia structure of the Generative Lexicon in [5] and the Extended Generative Lexicon theory [3]. My proposal is based on the Japanese genitive postposition no. The Japanese “NP1-no NP2” construction expresses a wider range of relations between two entities than does the English possessive “NP1’s NP2,” such that the Pustejovskian qualia roles encoded i...

متن کامل

A Proposal for Word Sense Disambiguation using Conceptual Distance

This paper presents a method for the resolution of lexical ambiguity and its automatic evaluation over the Brown Corpus. The method relies on the use of the wide-coverage noun taxonomy of WordNet and the notion of conceptual distance among concepts, captured by a Conceptual Density formula developed for this purpose. This fully automatic method requires no hand coding of lexical entries, hand t...

متن کامل

Chinese Base-Phrases Chunking

This paper introduces new definitions of Chinese base phrases and presents a hybrid model to combine Memory-Based Learning method and disambiguation proposal based on lexical information and grammar rules populated from a large corpus for 9 types of Chinese base phrases chunking. Our experiment achieves an accuracy (F-measure) of 93.4%. The significance of the research lies in the fact that it ...

متن کامل

A Hybrid Model for Word Sense Disambiguation in English- Portuguese Machine Translation

We present the proposal for an approach to word sense disambiguation with application in machine translation from English to Brazilian Portuguese. This approach follows a hybrid natural language processing method, that is, a mixture of knowledge and corpus-based approaches. The main innovative feature is the formalism that we intend to use to represent the instances and the background knowledge...

متن کامل

Race-Based Parsing and Syntactic Disambiguation

We present a processing model that integrates some important psychological claims about the human sentence parsing mechanism, namely that processing is innuenced by limitations on working memory and by various syntactic preferences. The model uses time-constraint information to resolve connicting preferences in a psychologically plausible way. The starting point for this proposal is the Sausage...

متن کامل

Production of English Lexical Stress by Persian EFL Learners

This study examines the phonetic properties of lexical stress in English produced by Persian speakers learning English as a foreign language. The four most reliable phonetic correlates of English lexical stress, namely fundamental frequency, duration, intensity, and vowel quality were measured across Persian speakers’ production of the stressed and unstressed syllables of five English disyllabi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991